feat: Add optional idempotency support to batches API #3171

mattf · 2025-08-16T16:16:50Z

Implements optional idempotency for batch creation using idem_tok parameter:

Core idempotency: Same token + parameters returns existing batch
Conflict detection: Same token + different parameters raises HTTP 409 ConflictError
Metadata order independence: Different key ordering doesn't affect idempotency

API changes:

Add optional idem_tok parameter to create_batch() method
Enhanced API documentation with idempotency extensions

Implementation:

Reference provider supports idempotent batch creation
ConflictError for proper HTTP 409 status code mapping
Comprehensive parameter validation

Testing:

Unit tests: focused tests covering core scenarios with parametrized conflict detection
Integration tests: tests validating real OpenAI client behavior

This enables client-side retry safety and prevents duplicate batch creation when using the same idempotency token, following REST API

closes #3144

Implements optional idempotency for batch creation using `idem_tok` parameter: * **Core idempotency**: Same token + parameters returns existing batch * **Conflict detection**: Same token + different parameters raises HTTP 409 ConflictError * **Metadata order independence**: Different key ordering doesn't affect idempotency **API changes:** - Add optional `idem_tok` parameter to `create_batch()` method - Enhanced API documentation with idempotency extensions **Implementation:** - Reference provider supports idempotent batch creation - ConflictError for proper HTTP 409 status code mapping - Comprehensive parameter validation **Testing:** - Unit tests: focused tests covering core scenarios with parametrized conflict detection - Integration tests: tests validating real OpenAI client behavior This enables client-side retry safety and prevents duplicate batch creation when using the same idempotency token, following REST API

graphite-app · 2025-08-16T16:16:59Z

How to use the Graphite Merge Queue

Add either label to this PR to merge it via the merge queue:

add-to-merge-queue - adds this PR to the back of the merge queue
hotfix - for urgent hot fixes, skip the queue and merge this PR next

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

mattf · 2025-08-16T16:17:07Z

@franciscojavierarceo ptal

derekhiggins · 2025-08-19T14:10:37Z

llama_stack/apis/batches/batches.py

        endpoint: str,
        completion_window: Literal["24h"],
        metadata: dict[str, str] | None = None,
+        idem_tok: str | None = None,  # intentionally bad name


what does this comment mean?

idem_tok is an intentionally bad name.

fwiw, i would have just called it idempotency_key, was the choice here because that's too verbose?

exactly this. choice was to get feedback.

it's a non-standard param. chances are other APIs will follow the lead here.

idem_tok -> idempotency_key ?

i am preferential to idempotency_key because I actually had to confirm that idempotency_token was equivalent to it (which it is according to lots of google hits), so I imagine it's just convention exposure and key was exposed to me first. token can be an unfortunately loaded term in our ai land fwiw.

derekhiggins · 2025-08-19T14:16:22Z

llama_stack/providers/inline/batches/reference/batches.py

+        # allowing us to detect parameter conflicts
+        if idem_tok is not None:
+            hash_input = idem_tok.encode("utf-8")
+            hash_digest = hashlib.sha256(hash_input).hexdigest()[:24]


The default batch id use's a 16 char hex section, is there a reason to use a different length here?

secret way to tell the difference. happy to align them.

oh that's fine then. i personally like to add a prefix for those reasons (OpenAI follows this but they don't expose an idempotency key) but different size is okay.

franciscojavierarceo · 2025-08-19T18:12:34Z

llama_stack/providers/inline/batches/reference/batches.py

+        # For idempotent requests, use the idempotency token for the batch ID
+        # This ensures the same token always maps to the same batch ID,
+        # allowing us to detect parameter conflicts
+        if idem_tok is not None:


I assumed we would generate the idempotency key on the server based on the concatenation of input metadata (similar to how we generate chunk_id) and then store that in the DB, rather than expose it in the API.

could you elaborate on the rational to have it defined by the user of the client?

openai's /v1/batches allows for creating duplicate batches. if we don't expose something and de-dup server side, we'll break semantics.

another approach i considered was user passed allow_duplicates, which defaulted to true.

but the common approach for this functionality is a user controlled key.

idempotency is always a client side concept. the client is telling the server the unique context under which the request was generated so that the server can dedupe things correctly.

Yes you both are right of course. 🤦🏻

I forgot that this should be generated by the client.

https://medium.com/@sahintalha1/the-way-psps-such-as-paypal-stripe-and-adyen-prevent-duplicate-payment-idempotency-keys-615845c185bf

franciscojavierarceo

lgtm, some small nits

ashwinb · 2025-08-22T22:50:02Z

tests/unit/providers/batches/conftest.py

+
+
+@pytest.fixture
+async def provider():


this is a rather non-specific name for a fixture which isn't actually that general. can you rename this batches_provider?

I am merging this anyhow as I prepare towards a release.

Implements optional idempotency for batch creation using `idem_tok` parameter: * **Core idempotency**: Same token + parameters returns existing batch * **Conflict detection**: Same token + different parameters raises HTTP 409 ConflictError * **Metadata order independence**: Different key ordering doesn't affect idempotency **API changes:** - Add optional `idem_tok` parameter to `create_batch()` method - Enhanced API documentation with idempotency extensions **Implementation:** - Reference provider supports idempotent batch creation - ConflictError for proper HTTP 409 status code mapping - Comprehensive parameter validation **Testing:** - Unit tests: focused tests covering core scenarios with parametrized conflict detection - Integration tests: tests validating real OpenAI client behavior This enables client-side retry safety and prevents duplicate batch creation when using the same idempotency token, following REST API closes llamastack#3144

mattf requested review from ashwinb, bbrowning, ehhuang, hardikjshah, leseb, raghotham, reluctantfuturist, slekkala1 and terrytangyuan as code owners August 16, 2025 16:16

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 16, 2025

derekhiggins reviewed Aug 19, 2025

View reviewed changes

franciscojavierarceo reviewed Aug 19, 2025

View reviewed changes

franciscojavierarceo approved these changes Aug 19, 2025

View reviewed changes

rename idem_tok -> idempotency_key

427aba3

mattf requested a review from yanxi0830 as a code owner August 19, 2025 20:34

ashwinb approved these changes Aug 22, 2025

View reviewed changes

ashwinb merged commit cffc4ed into llamastack:main Aug 22, 2025
22 checks passed

cdoern mentioned this pull request Sep 3, 2025

feat: introduce api leveling proposal #3317

Merged



		@pytest.fixture
		async def provider():

feat: Add optional idempotency support to batches API #3171

feat: Add optional idempotency support to batches API #3171

Uh oh!

Conversation

mattf commented Aug 16, 2025

Uh oh!

graphite-app bot commented Aug 16, 2025

How to use the Graphite Merge Queue

Uh oh!

mattf commented Aug 16, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

franciscojavierarceo Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

franciscojavierarceo Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

franciscojavierarceo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

franciscojavierarceo Aug 19, 2025 •

edited

Loading

franciscojavierarceo Aug 21, 2025 •

edited

Loading